首页> 外文OA文献 >Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference
【2h】

Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference

机译:扩张的CNNs与层状融合的单目深度估计   软加权和推论

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Monocular depth estimation is a challenging task in complex compositionsdepicting multiple objects of diverse scales. Albeit the recent great progressthanks to the deep convolutional neural networks (CNNs), the state-of-the-artmonocular depth estimation methods still fall short to handle such real-worldchallenging scenarios. In this paper, we propose a deep end-to-end learningframework to tackle these challenges, which learns the direct mapping from acolor image to the corresponding depth map. First, we represent monocular depthestimation as a multi-category dense labeling task by contrast to theregression based formulation. In this way, we could build upon the recentprogress in dense labeling such as semantic segmentation. Second, we fusedifferent side-outputs from our front-end dilated convolutional neural networkin a hierarchical way to exploit the multi-scale depth cues for depthestimation, which is critical to achieve scale-aware depth estimation. Third,we propose to utilize soft-weighted-sum inference instead of the hard-maxinference, transforming the discretized depth score to continuous depth value.Thus, we reduce the influence of quantization error and improve the robustnessof our method. Extensive experiments on the NYU Depth V2 and KITTI datasetsshow the superiority of our method compared with current state-of-the-artmethods. Furthermore, experiments on the NYU V2 dataset reveal that our modelis able to learn the probability distribution of depth.
机译:单眼深度估计是描述各种规模的多个物体的复杂构图中的一项艰巨任务。尽管近来有了深层卷积神经网络(CNN)的巨大进步,但最新的单眼深度估计方法仍不足以应对此类现实世界中充满挑战的场景。在本文中,我们提出了一个深层的端到端学习框架来应对这些挑战,该学习框架学习了从彩色图像到相应深度图的直接映射。首先,与基于回归的公式相比,我们将单眼深度估计表示为多类别的密集标记任务。通过这种方式,我们可以建立在诸如语义分割之类的密集标记的最新进展上。其次,我们将前端扩张式卷积神经网络的不同侧输出以分层方式融合,以利用多尺度深度线索进行深度估计,这对于实现可感知尺度的深度估计至关重要。第三,我们建议使用软加权和推理代替硬最大值,将离散化的深度得分转换为连续的深度值。因此,减少了量化误差的影响,提高了方法的鲁棒性。在NYU Depth V2和KITTI数据集上进行的大量实验表明,与当前的最新方法相比,我们的方法具有优越性。此外,对NYU V2数据集的实验表明,我们的模型能够学习深度的概率分布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号